Exploring the Cooccurrence Patterns of Multiple Sets of Genomic Intervals

نویسندگان

  • Hao Wu
  • Zhaohui S. Qin
چکیده

BACKGROUND Exploring the spatial relationship of different genomic features has been of great interest since the early days of genomic research. The relationship sometimes provides useful information for understanding certain biological processes. Recent advances in high-throughput technologies such as ChIP-seq produce large amount of data in the form of genomic intervals. Most of the existing methods for assessing spatial relationships among the intervals are designed for pairwise comparison and cannot be easily scaled up. RESULTS We present a statistical method and software tool to characterize the cooccurrence patterns of multiple sets of genomic intervals. The occurrences of genomic intervals are described by a simple finite mixture model, where each component represents a distinct cooccurrence pattern. The model parameters are estimated via an EM algorithm and can be viewed as sufficient statistics of the cooccurrence patterns. Simulation and real data results show that the model can accurately capture the patterns and provide biologically meaningful results. The method is implemented in a freely available R package giClust. CONCLUSIONS The method and the software provide a convenient way for biologists to explore the cooccurrence patterns among a relatively large number of sets of genomic intervals.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploring the Patterns of In-Between Spaces in Guilan Historical Houses

The aim of this paper is to explain the spatial patterns of in-between spaces in Guilan historical houses in order to show their potential capacity in having various functions, and thus different forms, in the course of history. In-between spaces are mediators between two other spaces making them accessible or visible for each other. An explanation of their spatial patterns can both reveal the ...

متن کامل

The effects of different rest interval durations between resistance exercise sets on gene expression of CGRP and IGF-1 of muscle in male wistar rats

Determining the best rest interval durations between resistance exercise sets for adaptation is very important. This study investigated the effect of different rest intervals duration between resistance exercise (RE) sets on the gene expression of CGRP and IGF-1. Forty two male Wistar rats were randomly divided in to 7 groups. The resistance exercise included one session of climbing on one mete...

متن کامل

APPLICATION OF 5.8 S GENE AND ITS, PCR-RFLP PATTERNS IN TAXONOMY OF NEOTYPHODIUM ENDOPHYTIC FUNGI

Endophytic fungi have mutualistic relationship with the plant family Poaceae. These fungi confer characteristics such as yield increase and biotic and abiotic stress resistance to host plants. Endophytes are classified in the familyClavicipitaceae. The endophytes spend all their life cycle in the aerial parts of plant hosts and live intercellularly. In the present investigation, endophytic fung...

متن کامل

A Framework for Exploring the Frequent Patterns based on Activities Sequence

In recent years, the development of the use of location-based tools has made it possible to produce geometric trajectories from the user's movement paths. In this way, users' goal of traveling and related activities can be considered in addition to the geometry and route shape. the user activity trajectory represents the sequence of the visited activities and its related analysis as presented i...

متن کامل

A committee machine approach for predicting permeability from well log data: a case study from a heterogeneous carbonate reservoir, Balal oil Field, Persian Gulf

Permeability prediction problem has been examined using several methods such as empirical formulas, regression analysis and intelligent systems especially neural networks and fuzzy logic. This study proposes an improved and novel model for predicting permeability from conventional well log data. The methodology is integration of empirical formulas, multiple regression and neuro-fuzzy in a commi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 2013  شماره 

صفحات  -

تاریخ انتشار 2013